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METHOD, SYSTEM, AND PROGRAM FOR VERIFYING 



NETWORK ADDRESSES INCLUDED IN A FILE 



BACKGROUND OF THE INVENTION 



5 1. Field of the Invention 

[0001] Provided is a method, system, and program for verifying network addresses included 
in a file. 

2. Description of the Related Art 

1 0 [0002] Oftentimes, Intemet Web addresses, also known as Universal Resource Locators 
(URLs), are embedded in hypertext links within an electronic document, such as a Hypertext 
Markup Language (HTML) page created with a web pubUshing tool, an e-mail created with an 
e-mail text editor, and a document created with a word processing document. The person 
viewing the electronic document may then select the hypertext link to access the page 

1 5 addressed by the URL embedded in the hypertext link. Document authors include hypertext 
links to provide the reader access to other web pages of interest including related information. 
[0003] Currently, the accuracy of the URL included in the hypertext Unk cannot be checked 
by a spell checker program because the URL itself does not form a recognized English 
language word, e.g., "www.ibm.com". Moreover, a document author may believe they have 

20 included the correct URL in a hypertext link in the document, when in fact the URL they 

inserted addresses a web page that is different from the one they intended. For instance, the 
author may insert the URL "www.whitehouse.com" into a document intending to provide a 
hypertext link to the ofl&cial United States government Intemet Web page for the White House, 
when in fact the correct official government Web for the White House is 

25 "www.whitehouse.gov". The author of the web page may be placed in a compromising 

situation if the hypertext Unk included in the document incorrectly references an Intemet Web 




Exp. Mail No. EL821 15741 lUS 
.2- Docket No. ROC920000170US1 

Firm No. 0021.0013 

page including content entirely unrelated to the intended Web page or including offensive or 
objectionable material. For instance, the incorrect URL embedded in the hypertext link may 
reference a web page including objectionable material, such as profane, pornographic, hateful 
or otherwise inappropriate content. If the Web page is directed to a sensitive target audience, 
5 such as children, business associates, potential customers, etc., then such a mistake could be 
particularly troublesome. 

[0004] This same problem may occur if the author is preparing an e-mail to send to business 
associates or other acquaintances and includes a hypertext link to a URL in the e-mail. The e- 
mail author would face the same embarrassment and problems discvissed above by including a 
1 0 wrong hypertext link to a Web page containing material offensive to the e-mail recipients. 

[0005] For these reasons, there is a need in the art for an improved electronic file editing tool 
that assists people in including correct network addresses, such as hypertext links, in the 
electronic file. 

15 SUMMARY OF THE PREFERRED EMBODIMENTS 

[0006] Provided is a method, system, and program for verifying a network address. A 
network address included within a file is accessed. At least one context term is provided. 
Content is accessed at the network address in the file and a determination is made as to 
whether the accessed content satisfies a qualifying threshold with respect to the at least one 

20 context term. 

[0007] Further, the network address may comprise a Universal Resource Locator (URL). In 
such case, accessing the content at the network addresses comprises downloading a file 
addressed by the URL over the Intemet. 

[0008] Moreover, the network address may be embedded in a hypertext link in the file. 
25 [0009] Additionally, the network address may be comprised of a string. In such case, if the 
accessed content does not satisfy the qualifying threshold, then a determination is made of at 




Exp. Mail No. EL821 15741 lUS 
.3. Docket No. ROC920000170US1 

Firm No. 0021.0013 

least one variation of the string, wherein a determined variation of the string is capable of 
comprising a network address at which content is located. For each determined string 
variation, a request is submitted to access content at the network address comprising the string 
variation. If content is accessed in response to the submitted request, then a determination is 
5 made as to whether the accessed content at the network address comprising the string variation 
satisfies the qualifying threshold with respect to the at least one context term, 
[0010] Yet further, if the content accessed fi"om the network address comprising the string 
variation satisfies the qualifying threshold, then the network address comprising the string 
variation is added to a hst of network addresses. User selection of one of the network 
1 0 addresses on the list may be received to substitute for the network address in the file addressing 
content that did not satisfy the qualifying threshold 

[001 1] The described implementations provide a technique for checking the network 
addresses or URLs in a file or page to determine whether the content addressed by the 
network address satisfies some threshold This threshold may comprise the content at the 

1 5 network address including a sufficient number of keywords that are in the file or document 
including the network address. Such determinations provide more robust checking of all the 
content within a file, including any listed network addresses or hypertext hnks. 
[0012] The described implementations may also be used with a viewer program to check 
whether network addresses included in a file being opened by a viewer program includes 

20 content that satisfies a certain threshold, indicating whether the hypertext link is correct. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0013] Referring now to the drawings in which Hke reference numbers represent 
corresponding parts throughout: 
25 FIG. 1 illustrates a network computing environment in which aspects of the invention 

are implemented; 
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FIGs. 2-4 illustrate logic to verify URLs and other network addresses embedded in 
hypertext links listed in a document being authored in accordance with implementations of the 
invention; 

FIG. 5 illustrates a Graphical User Interface (GUI) to allow checking and substitution 
5 of hypertext links in accordance with implementations of the invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0014] In the following description, reference is made to the accompanying drawings which 
form a part hereof, and which illustrate several embodiments of the present invention. It is 

1 0 understood that other embodiments may be utilized and stmctural and operational changes may 
be made without departing from the scope of the present invention. 
[0015] FIG. 1 illustrates a network environment 2 in which certain aspects of the invention 
are implemented. The network environment 2 includes a computer 4, e.g., a personal 
computer, workstation, server, mainframe, telephony device, handheld device, etc., that is in 

1 5 communication with the Intemet 6. The computer 4 is capable of accessing web servers 8a, b, 
...n which provide Intemet Web pages in response to Hypertext Transfer Protocol (HTTP) 
requests, or a request in any other recognized protocol, e.g.. File Transfer Protocol (FTP), etc. 
The computer 4 includes a computer readable medium 10, which may comprise any volatile or 
non-volatile memory device or storage device known in the art, or some combination thereof ^ 

20 The computer 4 would load programs into the computer readable medium 10 to execute. The 
computer 4 further includes an attached monitor 1 1 capable of rendering graphical user 
interfaces (GUIs) and other information to display. 

[0016] The computer 4 may load and execute a document editor program 12, which is 
capable of generating an electronic document 14 including text content and one or more 
25 hypertext links 16a, b that reference pages, files or content within one of the web servers 8a, 
b,...n. For instance, the document editor program 12 may be part of a Web pubUshing tool to 
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generate HTML web pages, part of an e-mail program for composing and sending e-mail 
messages or part of a word processing program (e.g., Corel WordPerfect, Microsoft Word, 
Lotus Word Pro, etc.)** 

[0017] The document editor program 12 may include a spell checker 18 program that can 
5 determine whether text strings in the document 14 match entries in a dictionaiy of recognized 
words in a manner known in the art. The document editor program 12 further includes a URL 
checker 20 that performs hypertext link verification operations in accordance with the 
described implementations of the invention. 

[0018] In certain implementations, the URL checker 20 would be utilized to verify the 

10 accuracy of hypertext hnks 16a, b based on context terms within the document 14 being edited. 
If the URL checker 20 determines that the context terms in a predetermined vicinity of the 
hypertext link 16a, b, e.g., the same sentence, paragraph, page, etc., are not included at a 
threshold jfrequency within the web page referenced by the hypertext link 16a, b, c, then the 
URL checker 20 assumes that the referenced web page is not the page the author intended In 

1 5 such event, the URL checker 20 generates a URL variation list 22 including modified URLs 
that are variations of the URL in the hypertext hnk 16a, b. For instance, the modified URL in 
the list may have the same domain name as the URL in the hypertext link, e.g., "whitehouse", 
but a different top level domain name, e.g., .com (commercial business), .gov (a government 
agencies), .edu (educational institution), .net (network organizations), .ca (Canada), .th 

20 (Thailand), etc. Additionally, the URL may include possible different spellings of the domain 
name, e.g., whitehouse, and/or URLs including domain names that are morphological variations 
produced by applying a stemming algorithm known in the art to the domain name. From the 
URL variation hst 22, the URL checker program 20 would analyze the web pages referenced 
by the URLs in the URL variation list 22 to determine whether such web pages also satisfy the 

25 qualifying threshold The URL checker 20 program would then generate a set of possible 

correct URLs to include in the possible correct URL list 24, The URLs in the possible correct 
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URL list 24 may then be presented to the user to review and select to substitute for flie URL in 
the hypertext link I6ei, b the user added into tiie document. 

[0019] FIGs. 2, 3, and 4 illustrate logic implemented as code in the URL checker 20 to verify 
the accuracy of hypertext links 16a, b in the document 14 in accordance with certain 
5 implementations of the invention. With respect to FIG. 2, control begins at block 100 with the 
URL checker 20 being invoked. The URL checker 20 may be invoked when the user invokes 
the spell checker 18, or separately invoked through a user command or user selection of an 
item from a drop down menu. Still further, the URL checker 20 may be invoked when 
attempting to make the document 14 available, such as when sending the document 14 as an e- 
1 0 mail transmission or loading the document 14 onto a server to make accessible. After being 
invoked, the URL checker 20 scans (at block 102) the document 14 to locate hypertext links 
16a, b including URL addresses. For each located hypertext link 16a, b, the URL checker 
performs the loop at blocks 104 through 116, which may also include performing blocks 150 
through 214. 

1 5 [0020] At block 1 06, the URL checker 20 determines context terms in a predetermined 
vicinity of the hypertext link 16a, b being considered. The vicinity searched for context terms 
may include the current surrounding words, line, paragraph, page, pages, etc. The URL 
checker 20 then transmits (at block 108) an HTTP GET request to a web server 8a, b, c, to 
access the web page referenced in the hypertext link 16a, b. If (at block 1 10) the web page is 

20 received in response to the GET request, then the URL checker 20 scans (at block 1 12) the 

received web page to determine instances of context terms in the web page. The URL checker 
20 then determines (at block 1 14) whether the web page includes a sufficient number of 
instances of context terms to satisfy a predetermined context term threshold or qualifying 
threshold. For instance, the quaUfying threshold may require at least one or more instances of a 

25 certain percentage of the context terms. The qualifying threshold determination at block 1 14 
screens pages of the hypertext links to determine whether they are likely the web page the 
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author of the document 14 intended given the context of surrounding terms in the document 14. 
The premise of the determination at block 1 14 is that a retrieved web page including instances 
of context terms less than the qualifying threshold is likely not relevant to the content of the 
document 14 and, thus, not likely the intended web page. 
5 [0021] If (at block 1 14) the retrieved web page satisfies the qualifying threshold, i.e., includes 
a number of instances of context terms meeting or exceeding the threshold, then the web page 
referenced in the hypertext link 16a, b is likely the page the author of the document 14 
intended, and control proceeds (at block 1 16) back to block 106 to consider any further 
hypertext links 16a5 b located in the document 14. If the web page was not received (at block 

10 110), indicating that the UEUL of the hypertext link may no longer be valid, or if the web page 
does not satisfy (at block 1 14) the qualifying threshold, then control proceeds (at block 118) to 
block 150 in FIG. 3 to attempt to automatically generate a set of possible correct URLs the 
author intended. The logic of FIGs. 3 and 4 is performed for each hypertext Unk 1 6a, b in the 
document 14 that does not address a valid web page or does not satisfy the qualifying 

15 threshold. 

[0022] With respect to FIG. 3, the URL checker 20 generates (at block 150) modified 
URLs that each include the domain name in the URL of the hypertext link 16a, b and a top level 
domain (TLD), e.g., .com, .edu, .gov, etc., that is different than the top level domain embedded 
in the ULR of the hypertext link 16a, b. Each of these modified URLs including tiie same 
20 domain name and different top level domains are added (at block 152) to the URL variation Ust 



[0023] From blocks 154 to 160, the URL checker 20 generates modified URLs based on 
alternative spellings of the domain name. At block 154, the URL checker 20 calls the spell 
checker 18 to parse and spell check the URL domain name in the hypertext link 16a, b in a 
25 manner known in the spell checking art as a single term and/or as compound words to generate 
possible correct spellings of the domain name and'or the compound terms that comprise the 



22. 
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domain name. A set of possible correct spellings of the domain name is generated (at block 
156), which may include different possible correct spellings of the entire domain name or 
different combinations of the generated correct spellings of the compound terms forming the 
domain name. The URL checker 20 then generates (at block 158) a modified URL for each 
5 possible correct spelling in the generated set and appends (at block 160) the modified URLs to 
the URL variation list 22. 

[0024] At blocks 162-164, the URL checker 20 generates modified URLs using a stemming 
algorithm known in the art against the domain name and/or compound terms that comprise the 
domain name. At block 162, the URL checker 20 executes a stemming algorithm known in the 

10 art on the entire domain name and any compound terms that form the domain name. A 

modified URL is generated (at block 164) for each determined morphological variation of the 
domain name and/or different combinations of the morphological variations of the compound 
terms that form the domain name. These generated modified URLs including the 
morphological variations of the entire domain name or its compound terms are than appended 

1 5 (at block 1 66) to the URL variation list 22. Control then proceeds (at block 1 68) to block 
200 in FIG. 4 to process the URL variation Ust 22 to determine those modified URLs in the list 
22 hkely to be tiie URL the author intended to include in the hypertext link 16a, b. 
[0025] With respect to FIG. 4, control begins at block 200 where the URL checker 20 
performs a loop of steps at blocks 202 through 210 for each modified URL i in the URL 

20 variation list 22. At block 202, tiie URL 20 checker transmits an HTTP GET request to the 
modified URL z. If (at block 204) the requested web page is received, then the URL checker 
20 scans (at block 206) the received web page to determine instances of the context terms, 
previously determined at block 106 in FIG. 2, in the received web page. If (at block 208) the 
scanned web page does include a sufficient number of instances of context terms to satisfy the 

25 qualifying threshold, which is the same determination made at block 1 14 in FIG. 2, then the 
modified URL i is appended (at block 2 1 0) to the possible correct URL list 24. In additional 



• 
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implementations, the qualifying thresholds at block 208 applied to modified URLs and URLs 
embedded in hypertext links 16a, b at block 1 14 may be different After appending the URL / 
to the possible correct URL Ust 24 or failing to quahfy the web page addressed by the modified 
URL / at blocks 204 or 208, control proceeds (at block 212) back to block 200 to consider 
5 the next modified URL / in the URL variation list 22 until all modified URLs are considered. 
Upon completing the possible correct URL Ust 24, control proceeds (at block 214) to block 
1 16 in FIG. 2 to consider any further hypertext links 16a, b in the document 14. 
[0026] The possible correct URL list 24 resulting from the process in FIGs. 3 and 4 includes 
all URLs determined from the actual URL in the hypertext link 16a, b that address web pages 

1 0 that satisfy the qualifying threshold criteria that disqualified the URL in the hypertext link 1 6a, b. 
The URL checker 20 would generate a URL hst 22 and possible correct URL fist 24 for each 
URL in hypertext links 16a, b that does not include a sufficient number of instances of context 
terms to satisfy the qualifying threshold hi certain implementations, if a possible correct URL 
list 24 generated for a hypertext link 16a, b does not include any modified URLs, then no 

1 5 modified URL could be generated according to the logic of FIGs. 3 and 4 that satisfies the 
qualifying threshold 

[0027] FIG. 5 illustrates a graphical user interface (GUI) 300 the URL checker 20 may 
generate on the monitor 1 1 of the computer 4, named the "URL Check Box". The URL check 
box 300 would be displayed for each hypertext link 16a, b located in the document. FIG. 5 

20 shows the URL check box 300 displayed for the hypertext link 302 "www.whitehouse.com" 
located in the document 304. The located hypertext Unk 302 is displayed in the box 306 in the 
check box 300. AU the modified URLs in the possible correct URL Ust 24 generated for the 
URL in the hypertext link 302 are displayed in the "Replacement URL" box 308. User 
selection of the "View Web Page You Entered" push button 310 would cause the URL checker 

25 20 to retrieve and display the web page at the hypertext Unk 302 to view what the author 
entered into the document 14. User selection of flie "View Replacement Page" push button 
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312 would display the web page at the modified URL the user highlighted or otherwise selected 
in the Replacement URL box 308. Selection of the "Skip All" push button 3 14 would cause 
the URL checker 20 to bypass the current and any future instances of the located hypertext link 
302 in the document 304 and selection of the "Replace All" push button 316 would replace all 
5 instances of the located hypertext Unk 302 in the document 304 with the modified URL the user 
selected in the Replacement URL box 308. Similarly, the skip 318 and replace 320 push 
buttons skip or replace, respectively, the located instance of the hypertext link 302 for which 
the URL check box 300 was generated. 

[0028] In one implementation, the URL checker 20 may scan the document 14 to locate the 

10 first hypertext link 16a, generate the possible correct URL list 22 for that hypertext link 16a 
only, and then display the URL check box 300 for that hypertext link. After the autiior selects 
a particular action in the URL check box 300, the URL checker 20 would then locate the next 
hypertext Unk 16b in the document 14 and again generate the possible correct URL Ust 22 and 
display the URL check box 300 for the next located hypertext link 16b. These actions may 

1 5 then be performed until all hypertext links 16a, b in the document 14 are checked. 

[0029] The above described implementations provide a technique for automatically checking 
hypertext links an author included in an electronic document to determine whether the hypertext 
link references the web page the author intended to include in the document based on the 
context of the document. This URL checker 20 tool would help avoid the situation where an 

20 author inadvertently includes a wrong hypertext link that references web content that is 

unrelated to the content of the document or, worse, offends the target audience of the document 

and embarrasses flie author in the opinion of the target audience. 

[0030] Following are some alternative implementations of the invention. 

[0031] The preferred embodiments may be implemented as a method, apparatus or article of 

25 manufacture using standard progranHning and^or engineering techniques to produce software, 
firmware, hardware, or any combination thereof The term "article of manufacture" as used 
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herein refers to code or logic implemented in hardware logic (e.g., an integrated circuit chip, 
Field Programmable Gate Array (FPGA), Application Specific Integrated Circiiit (ASIC), etc.) 
or a computer readable medium (e.g., magnetic storage medium (e.g., hard disk drives, floppy 
disks,, tape, etc.), optical storage (CD-ROMs, optical disks, etc.), volatile and non-volatile 
5 memory devices (e.g., EEPROMs, ROMs, PROMs, RAMs, DRAMs, SRAMs, firmware, 
programmable logic, etc.). Code in the computer readable medium is accessed and executed 
by a processor. The code in which preferred embodiments are implemented may further be 
accessible through a transmission media or from a file server over a network. In such cases, 
the article of manufacture in which the code is implemented may comprise a transmission media, 

1 0 such as a network transmission Une, wireless transmission media, signals propagating through 
space, radio waves, infrared signals, etc. Of course, those skilled in the art will recognize that 
many modifications may be made to this configuration without departing from the scope of the 
present invention, and that the article of manufacture may comprise any information bearing 
medium known in the art. 

1 5 [0032] In certain described implementations, the URL checker is invoked in response to the 
author actively invoking the URL checker or some other checker, such as a spell checker 18. 
Additionally, the URL checker may be invoked when the author attempts to transmit or load 
the document to make available to the target audience. 

[0033] In certain described implementations, the URL checker is invoked to verify network 
20 addresses included in a document being edited. Alternatively, the URL checker may be 

included in a viewer program, such as an Intemet web browser or e-mail program, to check 
URLs when the target audience selects the document to load into the viewer program to render 
on the display 1 1 or other output means. In this way, the URL checker would alert the reader 
to possible incorrect hypertext hnks and suggest possible correct alternative URLs based on the 
25 domain name of the embedded hypertext Unk and the context of the document 
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[0034] In the described implementations, the URL checker 20 detemiined whether the 
content of the web page addressed by the hypertext link included a threshold number of 
instances of the context terms. In alternative implementations, the URL checker 20 may 
examine metadata or metatags associated with the web page. Metatags comprise the 
5 keywords associated with an Internet web page that are used by Intemet search engines to 
index and locate web pages in response to queries. The qualifying threshold would then be 
applied to the metadata or metatags associated with the web page. 
[0035] In certain described implementations, the context terms were gleaned from the 
document being edited Additionally, the author of the document may manually enter specific 

1 0 context terms to use when qualifying web pages. 

[0036] Still further, the author of the document may provide a Hst of negative context terms. 
In such case, if the hypertext link or generated modified URL addressed a document that 
included such negative context terms at a negative qualifying threshold, then such document 
would not quahfy or satisfy the qualifying threshold. For instance, an author of a web page or 

15 e-mail directed to a sensitive audience, e.g., children, business associates, customers, etc., may 
specify profane and offensive language as negative context terms to enable the URL checker 20 
to alert the author of hypertext links embedded in the document that point to web pages 
including such offensive language. The URL checker 20 would then generate a possible correct 
URL list 24 that would include alternative URLs that are free of such offensive material as 

20 specified in the negative context terms. Moreover, the URL checker 20 may provide a 

predefined list of negative context words including offensive and profane language that the URL 
checker 20 may reference when qualifying web pages. 

[0037] The implementations involving the use of negative context terms may be included in an 
Intemet filtering program to screen web pages based on the content of the web pages 
25 addressed by the hypertext links in the downloaded web page. Such an Intemet filtering 
program including the URL checker would allow parents, employers or others to block the 
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display of web pages including hypertext links to web pages including a sufficient number of the 
negative context terms. This would allow the exclusion of pages whose hypertext hnks include 
offensive content even if the web page including the hypertext link does not itself include 
negative context terms. 

5 [0038] In the described implementations, the network addresses or URLs being checked 
were embedded in hypertext links. In alternative implementations, a Ust of URLs or network 
addresses in a file may be checked even if such network addresses are not embedded in 
hypertext links. 

[0039] The algorithm of FIGs. 2, 3, and 4 described particular steps as occurring in a 
1 0 particular order. However, in altemative implementations, the order of the steps may be 
changed and certain steps removed and added without departing fi-om the scope of the 
invention. Moreover, different steps may be performed to execute the overall operation of the 
algorithm. 

[0040] The GUI implementation described located hypertext links as being bolded and the 
1 5 term to substitute as being highhghted. Alternatively, tiie located hypertext Unks may be 

indicated in other ways, such as displaying the terms in different colors, different fonts, different 
font size, etc. 

[0041] The GUI implementation was described as haviQg particular pushbuttons to cause a 
particular set of actions to occur. Alternatively, the URL check box GUI may include 
20 additional pushbuttons to provide additional functions or combinations of fimctions described 
herein, 

[0042] In the described implementations, the URL checker is used to check URL hypertext 
links to web pages. Additionally, the URL checker may be provided to check hypertext Unks 
including any type of network address known in the art, including non-URL type addresses, 
25 and check any type of content available at the network address identified in the hypertext link. 
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[0043] In the described implementations, the URL checker is used to check content on a 
web page. Additionally, if the URL in the hypertext link addresses content in media formats 
other than text, such as video, audio, three-dimensional graphics, etc., the URL checker may 
include the capabihty to parse and check whether content in such other media formats satisfies 
5 the qualifying threshold 

[0044] Moreover, the document including the hypertext link that is being processed by the 
URL checker may be in any document format known in the art, e.g., HTML, Extensible 
Markup Language (XML), ASCII, bitmap, etc. 

[0045] In the described implementations, the hypertext links are embedded in a document the 
1 0 author intends to transmit to a target audience. Additionally, the hypertext links being checked 
may be included in any type of file, regardless of whether the file is to be transmitted to a target 
audience. 

[0046] The described implementations involved checking hypertext links in a document. 
Additionally, the described implementations may be appUed to verifying network address that 

15 are Usted in the document and not embedded in hypertext links. 

[0047] hi the described implementations, the author used a GUI to review and substitute the 
hypertext links in the document Additionally, the user may review the possible correct URL 
list and then manually change selected hypertext links in the document without using the URL 
check box GUI shown in FIG. 5. 

20 [0048] The foregoing description of the preferred embodiments of the invention has been 
presented for the purposes of illustration and description. It is not intended to be exhaustive or 
to Umit the invention to the precise form disclosed. Many modifications and variations are 
possible in tight of the above teaching. It is intended that the scope of the invention be Umited 
not by this detailed description, but rather by the claims appended hereto. The above 

25 specification, examples and data provide a complete description of the manufacture and use of 
the composition of the invention. Since many embodiments of the invention can be made 
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without departing from the spirit and scope of the invention, the invention resides in the claims 
hereinafter appended 

** WordPerfect is a registered trademark of Corel Corporation; Word Pro is a registered 
trademark of Lotus Development Corporation. 



